
SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Gehrmann, Mathias 
Seemann , Gerhard 
Boss let, Klaus 
Czech, Joerg 

(ii) TITLE OF INVENTION: Fusion Protein for Prodrug Activity 
(iii) NUMBER OF SEQUENCES: 18 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Finnegan, Henderson, Farabow, Garrett & 

Dunner 

(B) STREET: 1300 I Street, N.W. 

(C) CITY: Washington 

(D) STATE: D.C. 

(E) COUNTRY: USA 

(F) ZIP: 20005-3315 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/129,379 

(B) FILING DATE: 30-SEP-1993 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: DE P 42 33 152.8 

(B) FILING DATE: 02-OCT-1992 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Barker, M. P. 

(B) REGISTRATION NUMBER: 32,013 

(C) REFERENCE /DOCKET NUMBER: 02481.1337-00000 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 202-408-4000 

(B) TELEFAX: 202-408-4400 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3314 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 



(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 145.. 283 

( ix ) FEATURE : 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: join(284. .1003, 1069.. 1119, 1263.. 3161) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: join( 145 . . 189 , 272.. 1003, 1069.. 1119, 1263.. 3161) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

CCAAGCTTAT GAATATGCAA ATCCTGCTCA TGAATATGCA AATCCTCTGA ATCTACATGG 60 

TAAATATAGG TTTGTCTATA CCACAAACAG AAAAACATGA GATCACAGTT CTCTCTACAG .120 

TTACTGAGCA CACAGGACCT CACC ATG GGA TGG AGC TGT ATC ATC CTC TTC 171 

Met Gly Trp Ser Cys lie lie Leu Phe 
-19 -15 

TTG GTA GCA AC A GCT ACA GGTAAGGGGC TCACAGTAGC AGGCTTGAGG 219 
Leu val Ala Thr Ala Thr 
-10 -5 

TCTGGACATA TATATGGGTG ACAATGACAT CCACTTTGCC TTTCTCTCCA CA GGT 274 

Gly 
-4 

GTC CAC TCC CAG GTC CAA CTG CAG GAG AGC GGT CCA GGT CTT GTG AGA 322 
Val His Ser Gin Val Gin Leu Gin Glu Ser Gly Pro Gly Leu Val Arg 
15 10 

CCT AGC CAG ACC CTG AGC CTG ACC TGC ACC GTG TCT GGC TTC ACC ATC 370 
Pro Ser Gin Thr Leu Ser Leu Thr Cys Thr Val Ser Gly Phe Thr He 
15 20 25 

AGC AGT GGT TAT AGC TGG CAC TGG GTG AGA CAG CCA CCT GGA CGA GGT 418 
Ser Ser Gly Tyr Ser Trp His Trp Val Arg Gin Pro Pro Gly Arg Gly 
30 35 40 45 

CTT GAG TGG ATT GGA TAC ATA CAG TAC AGT GGT ATC ACT AAC TAC AAC 466 
Leu Glu Trp He Gly Tyr He Gin Tyr Ser Gly He Thr Asn Tyr Asn 

50 55 60 



CCC TCT CTC AAA AGT AGA GTG ACA ATG CTG GTA GAC ACC AGC AAG AAC 514 
Pro Ser Leu Lys Ser Arg Val Thr Met Leu Val Asp Thr Ser Lys Asn 

65 70 75 

CAG TTC AGC CTG AGA CTC AGC AGC GTG ACA GCC GCC GAC ACC GCG GTC 56 2 

Gin Phe Ser Leu Arg Leu Ser Ser Val Thr Ala Ala Asp Thr Ala Val 
80 85 90 

TAT TAT TGT GCA AGA GAA GAC TAT GAT TAC CAC TGG TAC TTC GAT GTC 610 
Tyr Tyr Cys Ala Arg Glu Asp Tyr Asp Tyr His Trp Tyr Phe Asp Val 
95 100 105 

TGG GGC CAA GGG ACC ACG GTC ACC GTC TCC TCA GGA GGC GGT GGA TCG 65 8 

Trp Gly Gin Gly Thr Thr Val Thr Val Ser Ser Gly Gly Gly Gly Ser 
110 115 120 125 

GGC GGT GGT GGG TCG GGT GGC GGC GGA TCT GAC ATC CAG CTG ACC CAG 706 
Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Asp lie Gin Leu Thr Gin 

130 135 140 

AGC CCA AGC AGC CTG AGC GCC AGC GTG GGT GAC AGA GTG ACC ATC ACC 75 4 

Ser Pro Ser Ser Leu Ser Ala Ser Val Gly Asp Arg Val Thr He Thr 
145 150 155 

TGT AGT ACC AGC TCG AGT GTA AGT TAC ATG CAC TGG TAC CAG CAG AAG 80 2 

Cys Ser Thr Ser Ser Ser Val Ser Tyr Met His Trp Tyr Gin Gin Lys 
160 165 170 

CCA GGT AAG GCT CCA AAG CTG CTG ATC TAC AGC ACA TCC AAC CTG GCT 850 
Pro Gly Lys Ala Pro Lys Leu Leu He Tyr Ser Thr Ser Asn Leu Ala 
175 180 185 

TCT GGT GTG CCA AGC AGA TTC AGC GGT AGC GGT AGC GGT ACC GAC TTC 898 
Ser Gly Val Pro Ser Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Phe 
190 195 200 205 

ACC TTC ACC ATC AGC AGC CTC CAG CCA GAG GAC ATC GCC ACC TAC TAC 94 6 

Thr Phe Thr He Ser Ser Leu Gin Pro Glu Asp He Ala Thr Tyr Tyr 

210 215 220 

TGC CAT CAG TGG AGT AGT TAT CCC ACG TTC GGC CAA GGG ACC AAG CTG 99 4 

Cys His Gin Trp Ser Ser Tyr Pro Thr Phe Gly Gin Gly Thr Lys Leu 
225 230 235 

GAG ATC AAA GGTGAGTAGA ATTTAAACTT TGCTTCCTCA GTTGGATCTG 104 3 

Glu He Lys 
240 

AGTAACTCCC AATCTTCTCT CTGCA GAG CTC AAA ACC CCA CTT GGT GAC ACA 109 5 

Glu Leu Lys Thr Pro Leu Gly Asp Thr 

245 



ACT CAC ACA TGC CCA CGG TGC CCA GGTAAGCCAG CCCAGGACTC GCCCTCCAGC 1149 
Thr His Thr Cys Pro Arg Cys Pro 
250 255 

TCAAGGCGGG ACAAGAGCCC TAGAGTGGCC TGAGTCCAGG GACAGGCCCC AGCAGGGTGC 1209 

TGACGCATCC ACCTCCATCC CAGATCCCCG TAACTCCCAA TCTTCTCTCT GCA GCG 1265 

Ala 

GCG GCG GCG GTG CAG GGC GGG ATG CTG TAC CCC CAG GAG AGC CCG TCG 1313 
Ala Ala Ala Val Gin Gly Gly Met Leu Tyr Pro Gin Glu Ser Pro Ser 
260 265 270 

CGG GAG TGC AAG GAG CTG GAC GGC CTC TGG AGC TTC CGC GCC GAC TTC 1361 
Arg Glu Cys Lys Glu Leu Asp Gly Leu Trp Ser Phe Arg Ala Asp Phe 
275 280 285 290 

TCT GAC AAC CGA CGC CGG GGC TTC GAG GAG CAG TGG TAC CGG CGG CCG 1409 
Ser Asp Asn Arg Arg Arg Gly Phe Glu Glu Gin Trp Tyr Arg Arg Pro 

295 300 305 

CTG TGG GAG TCA GGC CCC ACC GTG GAC ATG CCA GTT CCC TCC AGC TTC 1457 
Leu Trp Glu Ser Gly Pro Thr Val Asp Met Pro Val Pro Ser Ser Phe 
310 315 320 

AAT GAC ATC AGC CAG GAC TGG CGT CTG CGG CAT TTT GTC GGC TGG GTG 1505 
Asn Asp He Ser Gin Asp Trp Arg Leu Arg His Phe Val Gly Trp Val 
325 330 335 

TGG TAC GAA CGG GAG GTG ATC CTG CCG GAG CGA TGG ACC CAG GAC CTG 1553 
Trp Tyr Glu Arg Glu Val He Leu Pro Glu Arg Trp Thr Gin Asp Leu 
340 345 350 

CGC ACA AGA GTG GTG CTG AGG ATT GGC AGT GCC CAT TCC TAT GCC ATC 1601 
Arg Thr Arg Val Val Leu Arg He Gly Ser Ala His Ser Tyr Ala He 
355 360 365 370 

GTG TGG GTG AAT GGG GTC GAC ACG CTA GAG CAT GAG GGG GGC TAC CTC 1649 
Val Trp Val Asn Gly Val Asp Thr Leu Glu His Glu Gly Gly Tyr Leu 

375 380 385 

CCC TTC GAG GCC GAC ATC AGC AAC CTG GTC CAG GTG GGG CCC CTG CCC 1697 
Pro Phe Glu Ala Asp He Ser Asn Leu Val Gin Val Gly Pro Leu Pro 
390 395 400 

TCC CGG CTC CGA ATC ACT ATC GCC ATC AAC AAC ACA CTC ACC CCC ACC 1745 
ser Arg Leu Arg He Thr He Ala He Asn Asn Thr Leu Thr Pro Thr 
405 410 415 

ACC CTG CCA CCA GGG ACC ATC CAA TAC CTG ACT GAC ACC TCC AAG TAT 1793 
Thr Leu Pro Pro Gly Thr He Gin Tyr Leu Thr Asp Thr Ser Lys Tyr 
420 425 430 



CCC AAG GGT TAC TTT GTC CAG AAC ACA TAT TTT GAC TTT TTC AAC TAC 
Pro Lys Gly Tyr Phe Val Gin Asn Thr Tyr Phe Asp Phe Phe Asn Tyr 
435 440 445 450 

GCT GGA CTG CAG CGG TCT GTA CTT CTG TAC ACG ACA CCC ACC ACC TAC 
Ala Gly Leu Gin Arg Ser Val Leu Leu Tyr Thr Thr Pro Thr Thr Tyr 

455 460 465 



1841 



1889 



ATC GAT GAC ATC ACC GTC ACC ACC AGC GTG GAG CAA GAC AGT GGG CTG 1937 
lie Asp Asp lie Thr Val Thr Thr Ser Val Glu Gin Asp Ser Gly Leu 
470 475 480 

GTG AAT TAC CAG ATC TCT GTC AAG GGC AGT AAC CTG TTC AAG TTG GAA 1985 
Val Asn Tyr Gin lie Ser Val Lys Gly Ser Asn Leu Phe Lys Leu Glu 
485 490 495 

GTG CGT CTT TTG GAT GCA GAA AAC AAA GTC GTG GCG AAT GGG ACT GGG 203 3 

Val Arg Leu Leu Asp Ala Glu Asn Lys Val Val Ala Asn Gly Thr Gly 
500 505 510 

ACC CAG GGC CAA CTT AAG GTG CCA GGT GTC AGC CTC TGG TGG CCG TAC 2081 
Thr Gin Gly Gin Leu Lys Val Pro Gly Val Ser Leu Trp Trp Pro Tyr 
515 520 525 530 

CTG ATG CAC GAA CGC CCT GCC TAT CTG TAT TCA TTG GAG GTG CAG CTG 2129 
Leu Met His Glu Arg Pro Ala Tyr Leu Tyr Ser Leu Glu Val Gin Leu 

535 540 545 

ACT GCA CAG ACG TCA CTG GGG CCT GTG TCT GAC TTC TAC ACA CTC CCT 2177 
Thr Ala Gin Thr Ser Leu Gly Pro Val Ser Asp Phe Tyr Thr Leu Pro 
550 555 560 

GTG GGG ATC CGC ACT GTG GCT GTC ACC AAG AGC CAG TTC CTC ATC AAT 2225 
Val Gly lie Arg Thr Val Ala Val Thr Lys Ser Gin Phe Leu lie Asn 
565 570 575 

GGG AAA CCT TTC TAT TTC CAC GGT GTC AAC AAG CAT GAG GAT GCG GAC 227 3 

Gly Lys Pro Phe Tyr Phe His Gly Val Asn Lys His Glu Asp Ala Asp 
580 585 590 

ATC CGA GGG AAG GGC TTC GAC TGG CCG CTG CTG GTG AAG GAC TTC AAC 2321 
lie Arg Gly Lys Gly Phe Asp Trp Pro Leu Leu Val Lys Asp Phe Asn 
595 600 605 610 

CTG CTT CGC TGG CTT GGT GCC AAC GCT TTC CGT ACC AGC CAC TAC CCC 2369 
Leu Leu Arg Trp Leu Gly Ala Asn Ala Phe Arg Thr Ser His Tyr Pro 

615 620 625 

TAT GCA GAG GAA GTG ATG CAG ATG TGT GAC CGC TAT GGG ATT GTG GTC 2417 
Tyr Ala Glu Glu Val Met Gin Met Cys Asp Arg Tyr Gly He Val Val 
630 635 640 



ATC GAT GAG TGT CCC GGC GTG GGC CTG GCG CTG CCG CAG TTC TTC AAC 
lie Asp Glu Cys Pro Gly Val Gly Leu Ala Leu Pro Gin Phe Phe Asn 
645 650 655 



2465 



AAC GTT TCT CTG CAT CAC CAC ATG CAG GTG ATG GAA GAA GTG GTG CGT 
Asn Val Ser Leu His His His Met Gin Val Met Glu Glu Val Val Arg 
660 665 670 

AGG GAC AAG AAC CAC CCC GCG GTC GTG ATG TGG TCT GTG GCC AAC GAG 
Arg Asp Lys Asn His Pro Ala Val Val Met Trp Ser Val Ala Asn Glu 
675 680 685 690 

CCT GCG TCC CAC CTA GAA TCT GCT GGC TAC TAC TTG AAG ATG GTG ATC 
Pro Ala Ser His Leu Glu Ser Ala Gly Tyr Tyr Leu Lys Met Val He 

695 700 705 

GCT CAC ACC AAA TCC TTG GAC CCC TCC CGG CCT GTG ACC TTT GTG AGC 
Ala His Thr Lys Ser Leu Asp Pro Ser Arg Pro Val Thr Phe Val Ser 
710 715 720 

AAC TCT AAC TAT GCA GCA GAC AAG GGG GCT CCG TAT GTG GAT GTG ATC 
Asn Ser Asn Tyr Ala Ala Asp Lys Gly Ala Pro Tyr Val Asp Val He 
725 730 735 

TGT TTG AAC AGC TAC TAC TCT TGG TAT CAC GAC TAC GGG CAC CTG GAG 
Cys Leu Asn Ser Tyr Tyr Ser Trp Tyr His Asp Tyr Gly His Leu Glu 
740 745 750 

TTG ATT CAG CTG CAG CTG GCC ACC CAG TTT GAG AAC TGG TAT AAG AAG 
Leu He Gin Leu Gin Leu Ala Thr Gin Phe Glu Asn Trp Tyr Lys Lys 
755 760 765 770 

TAT CAG AAG CCC ATT ATT CAG AGC GAG TAT GGA GCA GAA ACG ATT GCA 
Tyr Gin Lys Pro He He Gin Ser Glu Tyr Gly Ala Glu Thr He Ala 

775 780 785 

GGG TTT CAC CAG GAT CCA CCT CTG ATG TTC ACT GAA GAG TAC CAG AAA 
Gly Phe His Gin Asp Pro Pro Leu Met Phe Thr Glu Glu Tyr Gin Lys 
790 795 800 

AGT CTG CTA GAG CAG TAC CAT CTG GGT CTG GAT CAA AAA CGC AGA AAA 
Ser Leu Leu Glu Gin Tyr His Leu Gly Leu Asp Gin Lys Arg Arg Lys 
805 810 815 

TAT GTG GTT GGA GAG CTC ATT TGG AAT TTT GCC GAT TTC ATG ACT GAA 
Tyr Val Val Gly Glu Leu He Trp Asn Phe Ala Asp Phe Met Thr Glu 
820 825 830 

CAG TCA CCG ACG AGA GTG CTG GGG ATT AAA AAG GGG ATC TTC ACT CGG 
Gin Ser Pro Thr Arg Val Leu Gly Asn Lys Lys Gly He Phe Thr Arg 
835 840 845 850 



2513 



2561 



2609 



2657 



2705 



2753 



2801 



2849 



2897 



2945 



2993 



3041 



CAG AGA CAA CCA AAA AGT GCA GCG TTC CTT TTG CGA GAG AGA TAC TGG 3 0 89 

Gin Arg Gin Pro Lys Ser Ala Ala Phe Leu Leu Arg Glu Arg Tyr Trp 

855 860 865 

AAG ATT GCC AAT GAA ACC AGG TAT CCC CAC TCA GTA GCC AAG TCA CAA 3137 
Lys lie Ala Asn Glu Thr Arg Tyr Pro His Ser Val Ala Lys Ser Gin 
870 875 880 

TGT TTG GAA AAC AGC CCG TTT ACT TGAGCAAGAC TGATACCACC TGCGTGTCCC 3191 
Cys Leu Glu Asn Ser Pro Phe Thr 
885 890 

TTCCTCCCCG AGTCAGGGCG ACTTCCACAG CAGCAGAACA AGTGCCTCCT GGACTGTTCA 3 251 

CGGCAGACCA GAACGTTTCT GGCCTGGGTT TTGTGGTCAT CTATTCTAGC AGGGAACACT 3 311 

AAA 3314 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 909 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Gly Trp Ser Cys lie lie Leu Phe Leu Val Ala Thr Ala Thr Gly 
-19 -15 -10 -5 

Val His Ser Gin Val Gin Leu Gin Glu Ser Gly Pro Gly Leu Val Arg 
15 10 

Pro Ser Gin Thr Leu Ser Leu Thr Cys Thr Val Ser Gly Phe Thr lie 
15 20 25 

Ser Ser Gly Tyr Ser Trp His Trp Val Arg Gin Pro Pro Gly Arg Gly 
30 35 40 45 

Leu Glu Trp He Gly Tyr He Gin Tyr Ser Gly He Thr Asn Tyr Asn 

50 55 60 

Pro Ser Leu Lys Ser Arg Val Thr Met Leu Val Asp Thr Ser Lys Asn 

65 70 75 

Gin Phe Ser Leu Arg Leu Ser Ser Val Thr Ala Ala Asp Thr Ala Val 
80 85 90 

Tyr Tyr Cys Ala Arg Glu Asp Tyr Asp Tyr His Trp Tyr Phe Asp Val 
95 100 105 



Trp Gly Gin Gly Thr Thr Val Thr Val Ser Ser Gly Gly Gly Gly Ser 
110 115 120 125 

Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Asp lie Gin Leu Thr Gin 

130 135 140 

Ser Pro Ser Ser Leu Ser Ala Ser Val Gly Asp Arg Val Thr lie Thr 
145 150 155 

Cys Ser Thr Ser Ser Ser Val Ser Tyr Met His Trp Tyr Gin Gin Lys 
160 165 170 

Pro Gly Lys Ala Pro Lys Leu Leu lie Tyr Ser Thr Ser Asn Leu Ala 
175 180 185 

Ser Gly Val Pro Ser Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Phe 
190 195 200 205 

Thr Phe Thr lie Ser Ser Leu Gin Pro Glu Asp lie Ala Thr Tyr Tyr 

210 215 220 

Cys His Gin Trp Ser Ser Tyr Pro Thr Phe Gly Gin Gly Thr Lys Leu 
225 230 235 

Glu He Lys Glu Leu Lys Thr Pro Leu Gly Asp Thr Thr His Thr Cys 
240 245 250 

Pro Arg Cys Pro Ala Ala Ala Ala Val Gin Gly Gly Met Leu Tyr Pro 
255 260 265 

Gin Glu Ser Pro Ser Arg Glu Cys Lys Glu Leu Asp Gly Leu Trp Ser 
270 275 280 285 

Phe Arg Ala Asp Phe Ser Asp Asn Arg Arg Arg Gly Phe Glu Glu Gin 

290 295 300 

Trp Tyr Arg Arg Pro Leu Trp Glu Ser Gly Pro Thr Val Asp Met Pro 
305 310 315 

Val Pro Ser Ser Phe Asn Asp He Ser Gin Asp Trp Arg Leu Arg His 
320 325 330 

Phe Val Gly Trp Val Trp Tyr Glu Arg Glu Val He Leu Pro Glu Arg 
335 340 345 

Trp Thr Gin Asp Leu Arg Thr Arg Val Val Leu Arg He Gly Ser Ala 
350 355 360 365 

His Ser Tyr Ala He Val Trp Val Asn Gly Val Asp Thr Leu Glu His 

370 375 380 

Glu Gly Gly Tyr Leu Pro Phe Glu Ala Asp He Ser Asn Leu Val Gin 
385 390 395 



Val Gly Pro Leu Pro Ser Arg Leu Arg lie Thr lie Ala lie Asn Asn 
400 405 410 

Thr Leu Thr Pro Thr Thr Leu Pro Pro Gly Thr lie Gin Tyr Leu Thr 
415 420 425 

Asp Thr Ser Lys Tyr Pro Lys Gly Tyr Phe Val Gin Asn Thr Tyr Phe 
430 435 440 445 

Asp Phe Phe Asn Tyr Ala Gly Leu Gin Arg Ser Val Leu Leu Tyr Thr 

450 455 460 

Thr Pro Thr Thr Tyr lie Asp Asp lie Thr Val Thr Thr Ser Val Glu 
465 470 475 

Gin Asp Ser Gly Leu Val Asn Tyr Gin lie Ser Val Lys Gly Ser Asn 
480 485 490 

Leu Phe Lys Leu Glu Val Arg Leu Leu Asp Ala Glu Asn Lys Val Val 
495 500 505 

Ala Asn Gly Thr Gly Thr Gin Gly Gin Leu Lys Val Pro Gly Val Ser 
510 515 520 525 

Leu Trp Trp Pro Tyr Leu Met His Glu Arg Pro Ala Tyr Leu Tyr Ser 

530 535 540 

Leu Glu Val Gin Leu Thr Ala Gin Thr Ser Leu Gly Pro Val Ser Asp 
545 550 555 

Phe Tyr Thr Leu Pro Val Gly lie Arg Thr Val Ala Val Thr Lys Ser 
560 565 570 

Gin Phe Leu lie Asn Gly Lys Pro Phe Tyr Phe His Gly Val Asn Lys 
575 580 585 

His Glu Asp Ala Asp lie Arg Gly Lys Gly Phe Asp Trp Pro Leu Leu 
590 595 600 605 

Val Lys Asp Phe Asn Leu Leu Arg Trp Leu Gly Ala Asn Ala Phe Arg 

610 615 620 

Thr Ser His Tyr Pro Tyr Ala Glu Glu Val Met Gin Met Cys Asp Arg 
625 630 635 

Tyr Gly lie Val Val lie Asp Glu Cys Pro Gly Val Gly Leu Ala Leu 
640 645 650 

Pro Gin Phe Phe Asn Asn Val Ser Leu His His His Met Gin Val Met 
655 660 665 

Glu Glu Val Val Arg Arg Asp Lys Asn His Pro Ala Val Val Met Trp 
670 675 680 685 



Ser Val Ala Asn Glu Pro Ala Ser His Leu Glu Ser Ala Gly Tyr Tyr 

690 695 700 

Leu Lys Met Val lie Ala His Thr Lys Ser Leu Asp Pro Ser Arg Pro 
705 710 715 

Val Thr Phe Val Ser Asn Ser Asn Tyr Ala Ala Asp Lys Gly Ala Pro 
720 725 730 

Tyr Val Asp Val lie Cys Leu Asn Ser Tyr Tyr Ser Trp Tyr His Asp 
735 740 745 

Tyr Gly His Leu Glu Leu lie Gin Leu Gin Leu Ala Thr Gin Phe Glu 

750 755 760 765 

Asn Trp Tyr Lys Lys Tyr Gin Lys Pro lie lie Gin Ser Glu Tyr Gly 

770 775 780 

Ala Glu Thr lie Ala Gly Phe His Gin Asp Pro Pro Leu Met Phe Thr 
785 790 795 

Glu Glu Tyr Gin Lys Ser Leu Leu Glu Gin Tyr His Leu Gly Leu Asp 
800 805 810 

Gin Lys Arg Arg Lys Tyr Val Val Gly Glu Leu lie Trp Asn Phe Ala 
815 820 825 

Asp Phe Met Thr Glu Gin Ser Pro Thr Arg Val Leu Gly Asn Lys Lys 
830 835 840 845 

Gly lie Phe Thr Arg Gin Arg Gin Pro Lys Ser Ala Ala Phe Leu Leu 

850 855 860 

Arg Glu Arg Tyr Trp Lys lie Ala Asn Glu Thr Arg Tyr Pro His Ser 
865 870 875 

Val Ala Lys Ser Gin Cys Leu Glu Asn Ser Pro Phe Thr 
880 885 890 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
ACCAGAAGCT TATGAATATG CAAATC 



26 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GCCACCCGAC CCACCACCGC CCGATCCACC GCCTCCTGAG GAGACGGTGA CCGTGGTC 58 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GGTGGATCGG GCGGTGGTGG GTCGGGTGGC GGCGGATCTG ACATCCAGCT GACCCAGAGC 60 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
TGCAGGATCC AACTGAGGAA GCAAAGTTTA AATTCTACTC ACCTTTGATC 50 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 





(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



TTTTTAAGCT TAGATCTCCA CCTTGGTC 



28 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
AAAAATCTAG AATGCAGGTC CAACTGCAGG AGAG 34 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
AAAAAAGTGA TCAAAGCGTC TGGCGGGCCA CAGGGCGGGA TCCTGTAC 48 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
TTTTAAGCTT CAAGTAAACG GGCTGTT 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
TTTTGGTACC TTTGGATAAA AGACAGGTCC AACTGCAGGA GAG 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
AAAACCATGG GAATTCAAGC TTCGAGCTGG TACTACAGGT 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
TTTTAAGCTT CCATGGCGGC CGCTCATTGT TTGCCTCCCT GCTG 
(2) INFORMATION FOR SEQ ID NO: 14: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
AAAAAGATCT CCGCGTCTGG CGGGCCACAG TTACGTGTAG AAACCCCA 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
TTTTGGTACC TTTGGATAAA AGACAGGTCC AACTGCAGGA GAG 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
AAAAAGCTTA GATCTCCAGC TTGGTCCC 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
AAAGAATTCT GATCAAATCC TCGAGCTCAG GTTCACAAAA GGTAGAGAAA ACAGT 55 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
TTTAAGCTTA TTTTAATAAA TCCAATGT 
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